Exploiting unlabeled data and lexical/ ontological structure for frame-semantic parsing
نویسنده
چکیده
Semantic parsing is the challenge of taking a sentence as input and generating a structured output, ideally a lexically and syntactically neutral representation of the semantics of the sentence. One common computational representation of semantics uses predicate logic; another makes use of predicate-argument structures defined with respect to some lexicon. The latter formulation of the problem is addressed here. English FrameNet (http://framenet.icsi.berkeley.edu) is a lexical resource prepared in the course of a major linguistic annotation project at Berkeley. FrameNet encompasses three forms of data: • A structured lexicon of frames, which are complex concepts (typically events); associated lexical units, linguistic words and phrases that evoke a particular frame; and frame elements or roles, which constitute the participants or attributes constituting the internal structure of a frame. Frames and their roles exist in an inheritance hierarchy and are marked as related to each other in several different ways (similar to entries in WordNet). For the most part, the frames which were selected for inclusion in the current version of FrameNet are frequent and domain-general in English. • Hand-selected exemplar sentences from a corpus which illustrate a particular use of a frame; these are annotated to indicate the frame-evoking word and any arguments to its roles. These are generally annotated for just a single frame, though they may also involve concepts represented by other
منابع مشابه
Semi-Supervised Frame-Semantic Parsing for Unknown Predicates
We describe a new approach to disambiguating semantic frames evoked by lexical predicates previously unseen in a lexicon or annotated data. Our approach makes use of large amounts of unlabeled data in a graph-based semi-supervised learning framework. We construct a large graph where vertices correspond to potential predicates and use label propagation to learn possible semantic frames for new o...
متن کاملSemi-Supervised and Latent-Variable Models of Natural Language Semantics
This thesis focuses on robust analysis of natural language semantics. A primary bottleneck for semantic processing of text lies in the scarcity of high-quality and large amounts of annotated data that provide complete information about the semantic structure of natural language expressions. In this dissertation, we study statistical models tailored to solve problems in computational semantics, ...
متن کاملFeature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملStructured Probabilistic Models for Natural Language Semantics
The recent past has witnessed a predominance of robust empirical methods in natural language structure prediction, but mostly through the analysis of syntax. Wide-coverage analysis of the underlying meaning or semantics of natural language utterances still remains a major obstacle for language understanding. A primary bottleneck lies in the scarcity of high-quality and large amounts of annotate...
متن کاملFrame Semantic Parsing using Framester Knowledge Graphs
This paper introduces TakeFive, a new algorithm that performs frame semantic parsing using frame-oriented knowledge graph generated by Framester. TakeFive performs dependency parsing, identifies the words that evoke lexical frames, locates the roles and fillers for each frame, and runs coercion techniques.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009